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Abstract 

We prove a general lower bound on the average-case complexity of Shellsort: the 
average number of data-movements (and comparisons) made by a p-pass Shellsort for 
any incremental sequence is Q(pn 1+ p) for all p < logn. Using similar arguments, we 
analyze the average-case complexity of several other sorting algorithms. 



Q 

r/5 ; 1 Introduction 

The question of a nontrivial general lower bound (or upper bound) on the average complexity 
of Shellsort (due to D.L. Shell 0) has been open for about four decades ||, [b|] . We present 
such a lower bound for p-pass Shellsort for every p. 

Shellsort sorts a list of n elements in p passes using a sequence of increments h\ , . . . , h p . 
In the kth. pass the main list is divided in hk separate sublists of length n/hk, where the 
ith sublist consists of the elements at positions i mod hk of the main list (i = 1, . . . , hk). 
Every sublist is sorted using a straightforward insertion sort. The efficiency of the method 
is governed by the number of passes p and the selected increment sequence h\, . . . ,h p with 
hp = 1 to ensure sortedness of the final list. The original logn-pass increment sequence Q 
[n/2_|, [n/4j, . . . , 1 of Shell 14] uses worst case 0(n 2 ) time, but Papernov and Stasevitch 
H showed that another related sequence uses 0(re 3//2 ) and Pratt [11| extended this to a 
class of all nearly geometric increment sequences and proved this bound was tight. The 
currently best asymptotic method was found by Pratt |pH| . It uses all log 2 ra increments 
of the form 2*3 J < [n/2\ to obtain time 0(n log 2 n) in the worst case. Moreover, since 
every pass takes at least n steps, the average complexity using Pratt's increment sequence 
is 0(nlog 2 n). Incerpi and Sedgewick Q constructed a family of increment sequences for 

which Shellsort runs in time using (8/e 2 )logn passes, for every e > 0. B. 

* Supported in part by the NSERC Research Grant OGP0046613 and a CITO grant. Address: De- 
partment of Computing and Software, McMaster University, Hamilton, Ont L8S 4K1, Canada. Email: 
jiang@cas.mcmaster.ca 

Supported in part by the NSERC Research Grant OGP0046506, a CITO grant, and the Steacie Fel- 
lowship. Address: Department of Computer Science, University of Waterloo, Waterloo, Ont. N2L 3G1, 
Canada. E-mail: mli@math.uwaterloo.ca 

•'Partially supported by the European Union through NeuroCOLT II ESPRIT Working Group. Address: 
CWI, Kruislaan 413, 1098 SJ Amsterdam, The Netherlands. Email: paulv@cwi.nl 
1 "log" denotes the binary logarithm and "In" denotes the natural logarithm. 



1 



Chazelle (attribution in p2| ) obtained the same result by generalizing V. Pratt's method: 
instead of using 2 and 3 to construct the increment sequence use a and (a + 1) for fixed 
a which yields a worst-case running time of n log 2 n(a 2 / In 2 a) which is 0(n 1+e /V lo s n ) f or 
In 2 a = O(logn). Plaxton, Poonen and Suel [ 10 1 proved an Q(n 1+e ^^) lower bound for 



p passes of Shellsort using any increment sequence, for some e > 0; taking p = f2(logra) 
shows that the Incerpi-Sedgewick / Chazelle bounds are optimal for small p and taking p 
slightly larger shows a 0(nlog 2 n/(loglogn) 2 ) lower bound on the worst case complexity 
of Shellsort. Since every pass takes at least n steps this shows an Q(n log 2 n/(log logn) 2 ) 
lower bound on the worst-case of every Shellsort increment sequence. For the average-case 
running time Knuth || shows 0(n 5 / 3 ) for the best choice of increments in p = 2 passes 
and Yao [|l(| derives an expression for the average case for p = 3 that doesn't result in a 
comparable asymptotic analytic bound. Apart from this no nontrivial results are known for 



the average case; see ||, 12, 13 1 



Results: We show a general £l(pn 1+1 / p ) lower bound on the average-case running time 
of p-pass Shellsort under uniform distribution of input permutations for p < logn. For 
p > logn the lower bound is trivially £l(pn). This is the first advance on the problem of 



determining general nontrivial bounds on the average-case running time of Shellsort 11, 



l|, |, 0, H, 01 . Using the same simple method, we also obtain results on the average 
number of stacks or queues (sequential or parallel) required for sorting under the uniform 
distribution on input permutations. These problems have been studied before by Knuth || 



and Tarjan [15| for the worst case. 



Kolmogorov complexity and the Incompressibility Method: The technical tool to 
obtain our results is the incompressibility method. This method is especially suited for the 
average case analysis of algorithms and machine models, whereas average-case analysis is 
usually more difficult than worst-case analysis using more traditional methods. A survey 
of the use of the incompressibility method is [Q] Chapter 6, and recent work is Q. The 
most spectacular successes of the method occur in the computational complexity analysis 
of algorithms. 

Informally, the Kolmogorov complexity C{x) of a binary string x is the length of the 
shortest binary program (for a fixed reference universal machine) that prints x as its only 
output and then halts ||. A string x is incompressible if C{x) is at least \x\, the approxi- 
mate length of a program that simply includes all of x literally. Similarly, the conditional 
Kolmogorov complexity of x with respect to y, denoted by C(x\y), is the length of the 
shortest program that, with extra information y, prints x. And a string x is incompressible 
relative to y if C(x\y) is large in the appropriate sense. For details see Here we use that, 
both absolutely and relative to any fixed string y, there are incompressible strings of every 
length, and that most strings are nearly incompressible, by any standard. Another easy 
one is that significantly long subwords of an incompressible string are themselves nearly 
incompressible by any standard, even relative to the rest of the string. In the sequel we 
use the following easy facts (sometimes only implicitly). 

2 By a simple counting argument one can show that whereas some strings can be enormously compressed, 
like strings of the form 11 ... 1, the majority of strings can hardly be compressed at all. For every n there 
are 2 n binary strings of length n, but only ^2™^q 2 ! = 2™ — 1 possible shorter descriptions. Therefore, there 
is at least one binary string x of length n such that C{x) > n. Similarly, for every length n and any binary 
string y, there is a binary string x of length n such that C(x\y) > n. 

^Strings that are incompressible are patternless, since a pattern could be used to reduce the description 
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Lemma 1 Let c be a positive integer. For every fixed y, every finite set A contains at least 
(1 - 2~ C )|A| + 1 elements x with C(x\A,y) > [log |^4| J - c. 

Lemma 2 If A is a set, then for every y every element x £ A has complexity C(x\A, y) < 
log|A| + 0(l). 

The first lemma is proved by simple counting. The second lemma holds since x can be 
described by first describing A in 0(1) bits and then giving the index of x in the enumeration 
order of A. 



2 Shellsort 

A Shellsort computation consists of a sequence comparison and inversion (swapping) opera- 
tions. In this analysis of the average-case lower bound we count just the total number of data 
movements (here inversions) executed. The same bound holds for number of comparisons 
automatically. 

Theorem 1 A lower bound on the average number of inversions in a p-pass Shellsort with 
p < logn is (pn 1+ p^j. 

Proof. Let the list to be sorted consist of a permutation tt of the elements 1, . .. ,n. 
Consider a (hi, . . . , h p ) Shellsort where is the increment in the fcth pass and h p = 1. 
We assume that p < logn. For any 1 < i < n and 1 < k < p, let ma- be the number 
of elements in the h^-chain containing element i that are to the left of i at the beginning 
of pass k and are larger than i. Observe that Ya=i m i,k is the number of inversions in 
the initial permutation of pass k, and that the insertion sort in pass k requires precisely 
YA=i( m i,k + 1) comparisons. Let M denote the total number of inversions: 

n.p 

M := £ m itk . (1) 

i,k=l 



Claim 1 Given all the 's in an appropriate fixed order, we can reconstruct the original 
permutation tt. 

Proof. The mj p 's trivially specify the initial permutation of pass p. In general, given 
the mi fe's and the final permutation of pass k, we can easily reconstruct the initial permu- 
tation of pass k. □ 

Let M as in ([!]) be a fixed number. Let permutation it be a random permutation having 
Kolmogorov complexity 

C(ir\n,p, P) > logn! — logn. (2) 

length. Intuitively, we think of such patternless sequences as being random, and we use "random sequence" 
synonymously with "incompressible sequence." It is possible to give a rigorous formalization of the intuitive 
notion of a random sequence as a sequence that passes all effective tests for randomness, see for example M. 
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where P is the encoding program in the following discussion. The description in Claim [T] is 
effective and therefore its minimum length must exceed the complexity of it: 

C(m 1A ,...,m n:P \n,p,P) > C(n\n,p,P). (3) 

Any M as defined by ([!]) such that every division of M in rein's contradicts (|J) would be 
a lower bound on the number of inversions performed. There are 



i=i 



possible divisions of M into np nonnegative integral summands rein's. Every division can 
be indicated by its index j in an enumeration of these divisions. Therefore, a self-delimiting 
description of M followed by a description of j effectively describes the raja's. The length 
of this description must by definition exceed the length of the minimal effective description 
(the Kolmogorov complexity). That is, 

logL>(M) +logM + 21oglogM > C(mi,i, . . . , m n , p \n,p, P) + O(l). 

We know that M < pn 2 since every < n. We also don't need to consider p = f2(n). 
Together with (|2|) and (H), we have 

logD(Af) > logn! -41ogn + 0(l). (5) 

Case 1: Let M < np - 1. Then 

log D(M) < log i^A =M-UogM. 

Using (|5|) we find M = (re logn) and p = fi(logre). 

Case 2: Let M > np. Then by ((f) D(M) is bounded above by | 

, (M + np-l\ . . , M + rep-1 , rl M + np-1 
log = (np— ljlog hMlog- 



np — I J np — 1 M 

1, M + np-1 
+ 2 1 ° g („p-l)M + °' 1 >- 



The second term in the right-hand side equals 

log(l + ^) M -loge^- 1 

for n — ► oo. The third term in the righthand side goes to for n — ► oo. Therefore, the total 
right-hand side goes to 

/ M + np-1 \ 

(np - 1) log h loge 

V np — 1 ) 

4 Use the following formula ([Q], p. 10), 

log Q = 61og I + (a - b) log ^_ + 1 log + o(l). 
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for n — > oo. Together with (|5|) this yields 

M = Fl(pn 1+ p). 

Therefore, the running time of the algorithm is as stated in the theorem for every permu- 
tation 7r satisfying (|2|). [] By lemma |] at least a (1 — l/ra)-fraction of all permutations it 
require that high complexity. Therefore, the following is a lower bound on the expected 
number of comparisons of the sorting procedure: 

(l_I )0 (pn 1+ i), 

where we can ignore the contribution of the remaining (l/n)-fraction of all permutations. 
This gives us the theorem. □ 



Corollary 1 We can do the same analysis for the number of comparisons. Denote the 
analogues of m,j & by m!- k and the analogue of M by M' . Note that rn' ik = mj Counting 
the number of comparisons we observe that every element is compared at least once in every 
pass. Therefore, all the fc 's are positive integers and the number D(M) = („„_]}■ A 

similar calculation yields that M' = Q,{pn l+l / p ) again. 



Compare our lower bound on the average-case with the Plaxton-Poonen-Suel Q(n 1+e ' v^) 
worst case lower bound. Some special cases of the lower bound on the average-case com- 
plexity are: 

1. When p = 1, this gives asymptotically tight bound for the average number of inversions 
for Insertion Sort. 

2. When p = 2, Shellsort requires f2(n 3//2 ) inversions (the tight bound is known to be 
9(n 5 / 3 ) 1); 



3. When p = 3, Shellsort requires f2(n 4//3 ) inversions ([16] gives an analysis but not a 
comparable asymptotic formula); 

4. When p = log n/ log log n, Shellsort requires f2(nlog 2 n/ log log n) inversions; 

5. When p = logra, Shellsort requires f2(nlogn) inversions. When we consider compar- 
isons, this is of course the lower bound of average number of comparisons for every 
sorting algorithm. 



Let us refine the argument by taking into account the different increments hi, . . . , h p of the different 
passes maximizing the contribution of every pass separately. Fix the number of inversions in pass k as 
M k < n 2 /h k (k := 1, . . . ,p). Replace M in (|l]), (||), and (||) by the vector (Mi, . . . , M p ). With p < logn 
encoding the Mi's self-delimiting takes at most p(logn 2 + 2 log log n 2 ) — 0(log 2 n) bits. If all Mi > n 
(1 < i < p), then we find similar to before 

Mi+n-l\ {M p + n-l\ , 



log ^ n _ x )"\ n-1 ) + °( log n ) - logn! - 41 °S^ + °( 1 )- 

Altogether this leads to log((Mi+n-l)/(n-l)) . . . ((M p +n-l)/(n-l)) = log n-0((log 2 n)/n) which by the 
inequality of arithmetic and geometric means ((£ Mi)/p > (n M0 1/p ) yields (Mi + . . . M p +p(n - l))/p > 
n 1+ p. Just like before we now obtain M — Mi + . . . + M p = Q(pn 1+ p). So we need some more subtle 
argument to improve the lower bound. 
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6. Whenp = log 2 n, Shellsort requires Q(n log n) inversions but it also requires 0(n log 2 n) 
comparisons. (The running time is known to be B(nlog 2 n) in this case 

7. In general, when p = p{n) > log n, Shellsort requires £l(n ■ p(n)) comparisons because 
every pass trivially makes n comparisons. 

In (l^l it is mentioned that the existence of an increment sequence yielding an average 
0{n log n) Shellsort has been open for 30 years. The above lower bound on the average 
shows that the number p of passes of such an increment sequence (if it exists) is precisely 
p = 0(logn); all the other possibilities are ruled out. 

Remark 1 It is a simple consequence of the Shellsort analysis to obtain average-case lower 
bounds on some other sorting methods. Here we use Bubble Sort as an example. In 
the next section, we analyze stack-sort and queue-sort. A description and average-case 
analysis of Bubble Sort can be found in ||. It is well-known that Bubble Sort uses B(n 2 ) 
comparisons/exchanges on the average. We present a very simple proof of this fact. The 
number of exchanges is obviously at most n 2 , so we only have to consider the lower bound. 
In Bubble Sort we make at most n — 1 passes from left to right over the permutation to 
be sorted and move the largest element we have currently found right by exchanges. For 
a permutation ir of the elements 1, . . . , re, we can describe the total number of exchanges 
by M := J27=i m i wn ere rrii is the initial distance of element n — i to its proper place 
n — i. Note that in every pass more than one element may "bubble" right but that means 
simply that in the future passes of the sorting process an equal number of exchanges will 
be saved for the element to reach its final position. That is, every element executes a 
number of exchanges going right that equals precisely the initial distance between its start 
position to its final position. An almost identical analysis as that of Theorem [I] shows that 
log M/n > logn + O(l) for every M. As before this holds for an overwhelming fraction 
of all permutations, and hence gives us an Q(re 2 ) lower bound on the expected number of 
comparisons /exchanges . 



3 Sorting with Queues and Stacks 

Knuth H and Tarjan |15| have studied the problem of sorting using a network of queues or 
stacks. In particular, the main variant of the problem is: assuming the stacks or queues are 
arranged sequentially as shown in Figure |l| or in parallel as shown in Figure then how 
many stacks or queues are needed to sort n numbers. Here, the input sequence is scanned 
from left to right and the elements follow the arrows to go to the next stack or queue or 
output. 

5g " O O O O O O Input Permutation 

A sequence of stacks or queues 
Figure 1: Six stacks/queues arranged in sequential order 

Tarjan [jljj] actually studied arbitrary acyclic networks of stacks and queues. Our tech- 
nique will in general apply there. However we will concentrate on dealing with the above 
two main variants, and concentrate on the average-case analysis. 
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stacks/queues 



Figure 2: Six stacks/queues arranged in parallel order 



3.1 Sorting with Sequential Stacks 

The sequential stack sorting problem is in || exercise 5.2.4-20. We have k stacks numbered 
So, ... , Sk-i- The input is a permutation it of the elements l,...,n. Initially we push the 
elements of tt on Sq at most one at a time in the order in which they appear in tt. At 
every step we can pop a stack (the popped elements will move left in Figure ||) or push 
an incoming element on a stack. The question is how many stack are needed for sorting 
tt. It is known that k = logn stacks suffice, and ^logn stacks are necessary in the worst- 
case §, ||. Here we prove that the same lower bound also holds on the average with a very 
simple incompressibility argument. 

Theorem 2 On the average, at least \ log n stacks are needed for sequential stack sort. 

Proof. Fix a random permutation tt such that 

C(7r\n,P) < logn! — log = nlogn — O(logn), 

where P is an encoding program to be specified in the following. 

Assume that k stacks is sufficient to sort tt. We now encode such a sorting process. For 
every stack, exactly n elements pass through it. Hence we need perform precisely n pushes 
and n pops on every stack. Encode a push as and a pop as 1. It is easy to prove that 
different permutations must have different push/pop sequences on at least one stack. Thus 
with 2kn bits, we can completely specify the input permutation tt. Then, as before, 

2kn > logn! — logn = nlogn — O(logn). 

Hence, approximately k > ^ logn for the random permutation tt. 

Since most permutations are random, we can calculate the average-case lower bound as: 

1, n-1 1 1 1, 

- log n h 1 • - ~ - log n. 

2 n n 2 

□ 



6 In fact since each stack corresponds to precisely n pushes and n pops where the pushes and pops form 
a "balanced" string, the Kolmogorov complexity of such a sequence is at most g(n) := 2n — | logn + O(l) 
bits. So 2kg(n) bits would suffice to specifiy the input permutation. But this does not help to nontrivially 
improve the bound. 



7 



3.2 Sorting with Parallel Stacks 

Clearly, the input sequence 2, 3, 4, . . . , n, 1 requires n — 1 parallel stacks to sort. Hence the 
worst-case complexity of sorting with parallel stacks, as shown in Figure ^, is n— 1. However, 
most sequences do not need this many stacks to sort in parallel arrangement. The next two 
theorems show that on the average, ®(y/n) stacks are both necessary and sufficient. Observe 
that the result is actually implied by the connection between sorting with parallel stacks 



and longest increasing subsequences given in [15] and the bounds on the length of longest 



increasing subsequences of random permutations given in, |||, |3|. However, the proofs 
in [|], H ||] use deep results from probability theory (such as Kingman's ergodic theorem) 
and are quite sophisticated. Here we give simple proofs using incompressibility arguments. 

Theorem 3 On the average, the number of parallel stacks needed to sort n elements is 

Proof. Consider a random permutation tt such that 

C(ir\n) > logra! — logn. 

We use the following trivial algorithm (which is described in ]|1|) to sort tt with stacks in 
the parallel arrangement as shown in Figure ||. Assume that the stacks are named Sq, S±, . . . 
and the input sequence is denoted as x\, . . . ,x n . 

Algorithm Parallel-Stack-Sort 

1. For i = 1 to n do 

Scan the stacks from left to right, and push x% on the the first stack Sj whose top 
element is larger than X{. If such a stack doesn't exist, put Xi on the first empty 
stack. 

2. Pop the stacks in the ascending order of their top elements. 

We claim that algorithm Parallel-Stack-Sort uses 0(y/n) stacks on the permutation tt. 
First, we observe that if the algorithm uses m stacks on tt then we can identify an increasing 



subsequence of tt of length m as in 15]. This can be done by a trivial backtracing starting 
from the top element of the last stack. Then we argue that tt cannot have an increasing 
subsequence of length longer than e^/n, where e is the natural constant, since it is logn- 
incompressible . 

Suppose that a is a longest increasing subsequence of tt and m = \a\ is the length of a. 
Then we can encode tt by specifying: 

1. a description of this encoding scheme in 0(1) bits; 

2. the number m in logm bits; 

3. the permutation a in log (") bits; 

4. the locations of the elements of a in the permutation tt in at most log (?) bits; and 

5. the remaining tt with the elements of a deleted in log(n — m)\ bits. 
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This takes a total of 

tl\ 

login — m)\ + 2 log — — + logm + 0(1) + 2 log log m 

m\[n — m)\ 

bits. Using Stirling approximation and the fact that y/n < m = o(n), we can simplify the 
above expression as: 

log(n — m)\ + 2 log — - — : — + logm + 0(1) + 2 log log m 

m\(n — m)\ 

(n/e) n 

< logn! + log v ; + O(logn) 

[m / e) zm {(n — m) / e) n m 

ft ft 

ps log n! + m log — 7T + (n — m) log h m log e + 0(log n) 

m 2 n — m 

logn! + m log — ^ + 2m log e + O(logra) 
Hence we have inequality 

Tl 

log n! + m log — k + 2m log e + 0(log n) > log n\ — log n 
m z 

which requires that (approximately) m < e^/n = 0(y/n). 

The average complexity of Parallel-Stack-Sort can be simply calculated as: 

77 — 1 1 

0(y/n) + n-- = 0(y/n). 

77 71 

□ 

Theorem 4 On i/ie average, the number of parallel stacks required to sort a permutation 
is Q,(\/n). 

PROOF. Let A be any sorting algorithm using parallel stacks. Fix a random permutation 
7r with C(n\n,P) > logn! — logn, where P is the program to do the encoding discussed 
in the following. Suppose that A uses T parallel stacks to sort ir. This sorting process 
involves a sequence of moves, and we can encode this sequence of moves by a sequence of 
the following terms: 

• push to stack i, 

• pop stack j, 

where the element to be pushed is the next unprocessed element from the input sequence 
and the popped element is written as the next output element. Each of these term requires 
logT bits. In total, we use 2n terms precisely since every element has to be pushed once 
and popped once. Such a sequence is unique for every permutation. 

Thus we have a description of an input sequence with length 2nlogT bits, which must 
exceed C(tt\u, P) > nlogn — O(logn). It follows that approximately T > sjn = £l(y/n). 

We can calculate the average-case complexity of A as: 

77—1 1 

to(Vf£) ■ + 1 • - = tt(y/n~). 

n n 

□ 
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3.3 Sorting with Parallel Queues 



It is easy to see that sorting cannot be done with a sequence of queues. So we consider the 
complexity of sorting with parallel queues. It turns out that all the result in the previous 
subsection also hold for queues. 

As noticed in [JD]] , the worst-case complexity of sorting with parallel queues is n since the 
input sequence n, n — 1, . . . , 1 requires n queues to sort. We show in the next two theorems 
that on the average, 6( v / n) queues are both necessary and sufficient. Again, the result 
is implied by the connection between sorting with parallel queues and longest decreasing 
subsequences given in [15] and the bounds in [f|, ||, ||| (with sophisticated proofs). Our 



proofs are almost trivial given the proofs in the previous subsection. 

Theorem 5 On the average, the number of parallel queues needed to sort n elements is 
upper bounded by 0(- v /n). 

Proof. The proof is very similar to the proof of Theorem |j| We use a slightly modified 



greedy algorithm as described in IB]: 

Algorithm Parallel-Queue-Sort 

1. For i = 1 to n do 

Scan the queues from left to right, and append X{ on the the first queue whose rear 
element is smaller than x%. If such a queue doesn't exist, put Xi on the first 
empty queue. 

2. Delete the front elements of the queues in the ascending order. 

Again, we can claim that algorithm Parallel-Queue-Sort uses 0(y/n) queues on any 
log n-incompressible permutation ir. We first observe that if the algorithm uses m queues 
on 7r then a decreasing subsequence of ir of length m can be identified, and we then argue 
that 7r cannot have a decreasing subsequence of length longer than ey/n, in a way analogous 
to the argument in the proof of Theorem [|. □ 

Theorem 6 On the average, the number of parallel queues required to sort a permutation 
is £l(y/n). 

Proof. The proof is the same as the one for Theorem |] except that we should replace 
"push" with "enqueue" and "pop" with "dequeue". □ 



4 Open Questions 

We have shown that the incompressibility method is a quite useful tool for analyzing average- 
case complexity of sorting algorithms. Simplicity has been our goal. All the proofs and 
methodology presented here can be easily grasped and applied, as also demonstrated in |j|], 
and they can be easily taught in the classrooms to undergraduate students. 

The average-case performance of Shellsort has been one of the most fundamental and 
interesting open problems in the area of algorithm analysis. The simple average-case analysis 
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we made for Insertion Sort (1-pass Shellsort), Bubble Sort, stack-sort and queue-sort are 
for the purpose of demonstrating the generality and simplicity of our technique in analyzing 
many sorting algorithms. Several questions remain, such as: 

1. Prove tight average-case lower bound for Shellsort. Our bound is not tight for p = 2 
passes. 

2. For sorting with sequential stacks, can we close the gap between logn upper bound 
and the \ log n lower bound? 

5 Acknowledgements 

We thank I. Munro, V. Pratt, and R. Sedgewick for telling us many things about Shellsort and 
stack/queue sorts. 

References 

[1] H. Buhrman, T. Jiang, M. Li, and P. Vitanyi, New applications of the incompressibility method, 
submitted to ICALP'99. 

[2] J. Incerpi and R. Sedgewick, Improved upper bounds on Shellsort, Journal of Computer and 
System Sciences, 31(1985), 210-224. 

[3] S.V. Kerov and A.M. Versik, Asymptotics of the Planchcrel measure on symmetric group and 
the limiting form of the Young tableaux, Soviet Math. Dokl. 18 (1977), 527-531. 

[4] J.F.C. Kingman, The crgodic theory of subadditive stochastic processes, Ann. Probab. 1 (1973), 
883-909. 

[5] A.N. Kolmogorov, Three approaches to the quantitative definition of information. Problems 
Inform. Transmission, 1:1(1965), 1-7. 

[6] D.E. Knuth, The Art of Computer Programming, Vol.3: Sorting and Searching, Addison- 
Wesley, 1973 (1st Edition), 1998 (2nd Edition). 

[7] M. Li and P.M.B. Vitanyi, An Introduction to Kolmogorov Complexity and its Applications, 
Springer- Verlag, New York, 2nd Edition, 1997. 

[8] B.F. Logan and L.A. Shcpp, A variational problem for random Young tableaux, Advances in 
Math. 26 (1977), 206-222. 

[9] A. Papcrnov and G. Stasevich, A method for information sorting in computer memories, Prob- 
lems Inform. Transmission, 1:3(1965), 63-75. 

[10] C.G. Plaxton, B. Pooncn and T. Suel, Improved lower bounds for Shellsort, Proc. 33rd IEEE 
Symp. Foundat. Comput. ScL, pp. 226-235, 1992. 

[11] V.R. Pratt, Shellsort and Sorting Networks, Ph.D. Thesis, Stanford University, 1972. 

[12] R. Sedgewick, Analysis of Shellsort and related algorithms, presented at the Fourth Annual 
European Symposium on Algorithms, Barcelona, September, 1996. 

[13] R. Sedgewick, Open problems in the analysis of sorting and searching algorithms, Presented at 
Workshop on the Probabilistic Analysis of Algorithms, Princeton, May, 1997. 

[14] D.L. Shell, A high-speed sorting procedure, Commun. ACM, 2:7(1959), 30-32. 

[15] R.E. Tarjan, Sorting using networks of queues and stacks, Journal of the ACM, 19(1972), 341- 
346. 

[16] A.C.C. Yao, An analysis of (h, k, l)-Shellsort, Journal of Algorithms, 1(1980), 14-50. 



11 



